NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

To CoT or not To CoT? Chain-of-thought helps mainly on math and symbolic reasoning

Sprague, Zayne; Yin, Fangcong; Rodriguez, Juan Diego; Jiang, Dongwei; Wadhwa, Manya; Singhal, Prasann; Zhao, Xinyu; Ye, Xi; Mahowald, Kyle; Durrett, Greg (May 2025, Proceedings of the International Conference on Learning Representations)

Free, publicly-accessible full text available May 1, 2026
Models Can and Should Embrace the Communicative Nature of Human-Generated Math

Boguraev, Sasha; Lipkin, Ben; Weissweiler, Leonie; Mahowald, Kyle (December 2024, Math-AI Workshop, NeurIPS)

Full Text Available
Is It JUST Semantics? A Case Study of Discourse Particle Understanding in LLMs

https://doi.org/10.18653/v1/2025.findings-acl.1117

Sheffield, William Berkeley; Misra, Kanishka; Pyatkin, Valentina; Deo, Ashwini; Mahowald, Kyle; Li, Junyi Jessy (January 2025, indings of the Association for Computational Linguistics: ACL 2025)

Discourse particles are crucial elements that subtly shape the meaning of text. These words, often polyfunctional, give rise to nuanced and often quite disparate semantic/discourse effects,as exemplified by the diverse uses of the particle *just* (e.g., exclusive, temporal, emphatic). This work investigates the capacity of LLMs to distinguish the fine-grained senses of English *just*, a well-studied example in formal semantics, using data meticulously created and labeled by expert linguists. Our findings reveal that while LLMs exhibit some ability to differentiate between broader categories, they struggle to fully capture more subtle nuances, highlighting a gap in their understanding of discourse particles.
more » « less
Full Text Available
Are Language Models More Like Libraries or Like Librarians? Bibliotechnism, the Novel Reference Problem, and the Attitudes of LLMs

Lederman, Harvey; Mahowald, Kyle (September 2024, TACL)

Full Text Available
Do *they* mean ‘us’? Interpreting Referring Expression variation under Intergroup Bias

https://doi.org/10.18653/v1/2024.findings-emnlp.571

Govindarajan, Venkata S; Zang, Matianyu; Mahowald, Kyle; Beaver, David; Li, Junyi Jessy (November 2024, Findings of the Association for Computational Linguistics: EMNLP 2024, Association for Computational Linguistics)

The variations between in-group and out-group speech (intergroup bias) are subtle and could underlie many social phenomena like stereotype perpetuation and implicit bias. In this paper, we model intergroup bias as a tagging task on English sports comments from forums dedicated to fandom for NFL teams. We curate a dataset of over 6 million game-time comments from opposing perspectives (the teams in the game), each comment grounded in a non-linguistic description of the events that precipitated these comments (live win probabilities for each team). Expert and crowd annotations justify modeling the bias through tagging of implicit and explicit referring expressions and reveal the rich, contextual understanding of language and the world required for this task. For large-scale analysis of intergroup variation, we use LLMs for automated tagging, and discover that LLMs occasionally perform better when prompted with linguistic descriptions of the win probability at the time of the comment, rather than numerical probability. Further, large-scale tagging of comments using LLMs uncovers linear variations in the form of referent across win probabilities that distinguish in-group and out-group utterances.
more » « less
Full Text Available
Language models align with human judgments on key grammatical constructions

https://doi.org/10.1073/pnas.2400917121

Hu, Jennifer; Mahowald, Kyle; Lupyan, Gary; Ivanova, Anna; Levy, Roger (September 2024, Proceedings of the National Academy of Sciences)

Full Text Available
Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNs

https://doi.org/10.18653/v1/2024.emnlp-main.53

Misra, Kanishka; Mahowald, Kyle (January 2024, Association for Computational Linguistics)

Full Text Available
Mission: Impossible Language Models

https://doi.org/10.18653/v1/2024.acl-long.787

Kallini, Julie; Papadimitriou, Isabel; Futrell, Richard; Mahowald, Kyle; Potts, Christopher (January 2024, Association for Computational Linguistics)

Full Text Available
Dissociating language and thought in large language models

https://doi.org/10.1016/j.tics.2024.01.011

Mahowald, Kyle; Ivanova, Anna A; Blank, Idan A; Kanwisher, Nancy; Tenenbaum, Joshua B; Fedorenko, Evelina (March 2024, Trends in Cognitive Sciences)

Full Text Available
Elaborative Simplification as Implicit Questions Under Discussion

https://doi.org/10.18653/v1/2023.emnlp-main.336

Wu, Yating; Sheffield, William; Mahowald, Kyle; Li, Junyi Jessy (December 2023, Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing)

Automated text simplification, a technique useful for making text more accessible to people such as children and emergent bilinguals, is often thought of as a monolingual translation task from complex sentences to simplified sentences using encoder-decoder models. This view fails to account for elaborative simplification, where new information is added into the simplified text. This paper proposes to view elaborative simplification through the lens of the Question Under Discussion (QUD) framework, providing a robust way to investigate what writers elaborate upon, how they elaborate, and how elaborations fit into the discourse context by viewing elaborations as explicit answers to implicit questions. We introduce ELABQUD, consisting of 1.3K elaborations accompanied with implicit QUDs, to study these phenomena. We show that explicitly modeling QUD (via question generation) not only provides essential understanding of elaborative simplification and how the elaborations connect with the rest of the discourse, but also substantially improves the quality of elaboration generation.
more » « less
Full Text Available

« Prev Next »

Search for: All records